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SEQUENCE LISTING 



Q 
P 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME : Kossmann, Jens 

Butt cher, Volker 
Welsh, Thomas 

(ii) TITLE OF INVENTION: DNA SEQUENCES ENCODING ENZYMES CAPABLE 

OF FACILITATING THE SYNTHESIS OF LINEAR 
ALPHA- 1 , 4 GLUCANS IN PLANTS, FUNGI AND 
MICROORGANISMS 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: James F. Haley, Jr., c/o Fish & Neave 

(B) STREET: 1251 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States of America 

(F) ZIP: 10020 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{ B) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC- DOS /MS-DOS 
1 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P 44 17 879.4 

(B) FILING DATE: 18-MAY-1994 



if\ 
* 

a 

M (vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P 44 47 388.5 

(B) FILING DATE: 22-DEC-1994 

(vii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: James F. Haley, Jr., Esq. 

(B) REGISTRATION NUMBER: 27,7 94 

(C) DOCKET NUMBER: GFB-1 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 596-9000 

(B) TELEFAX: (212) 596-9090 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library in pBlues criptl I SK 

(B) CLONE: pNB2 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 957 . .2867 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAGTTTTGCG TTCCCGAACC GAACGTGATG CTTGAGCCGA ACACCTGTCC GGCAAGGCGG 60 

O CTGACCGCCC CCTTTTGCCC CAT C GACAT C GTAACAATCG GTTTGGTGGC AAGCTCTTTC 12 0 

jjg GCTTTGAGCG TGGCAGAAAG CAAAGTCAGC ACGTCTTCCG CGCTTTGCGG CATCACCGCA 180 

ATTTTGCAGA TGTCCGCGCC GCAGTCCTCC ATCTGTTTCA GACGGCATAC GATTTCTTCT 24 0 

P TGCGGCGGCG TGCGGTGAAA CT CAT GATT G CAGAGCAGGG CGGCGATGCC GTTTTTTTGA 300 

GCATGCGCCA CGGCGCGCCG GACGGCGGTT TCGCCGGAAA AAAGCTCGAT ATCGATAATG 3 60 

j~i TCGGGCAGGC GGCTTTCAAT CAGCGAGTCG AGCAGTTCAA AATAATAATC GTCCGAACAC 42 0 

;?! GGGAACGAGC CGCCTTCGCC ATGCCGTCTG AACGTAAACA GCAGCGGCTT GTCGGGCAGC 4 80 

h ? 

Ifl GCGTCGCGGA CGGTCTGCGT GTGGCGCAAT ACTTCGCCGA TGCTGCCCGC GCATTCCAAA 54 0 

|2 AAATCGGCGC GGAACTCGAC GAT AT C GAAG GGCAGGTTTT TGATTTGGTC AAGTACGGCG 600 

GAAAGT AC G G CGGCATCGCG GGCGACAAGC GGCACGGCGA TTTTGGTGCG TCCGCTTCCG 660 

ATAACGGTGT TTTTGACGGT CAGGCTGGTG TGCATGGCGG TTGTTGCGGC TGAAAGGAAC 72 0 

GGTAAAGACG CAATTATAGC AAAGGCACAG GCAATGTTTC AGACGGCATT TCTGTGCGGC 780 

CGGCTTGATA TGAATCAAGC AGCATCCGCA TATCGGAATG CAGACTTGGC ACAAGCCCTG 840 

TCTTTTCTAG TCAGTCCGCA GTTCTTGCAG TAT GAT T G C A CGACACGCCC T ACAC GGCAT 900 

TTGCAGGATA CGGCGGCAGA CCGCCGGTCG GAAAC T T C AG AATCGGAGCA GGCAT C 956 

ATG TTG ACC CCC ACG CAG CAA GTC GGT TTG ATT TTA CAG TAC CTC AAA 1004 
Met Leu Thr Pro Thr Gin Gin Val Gly Leu lie Leu Gin Tyr Leu Lys 
15 10 15 

ACA CGC ATC TTG GAC ATC TAC ACG CCC GAA CAG CGC GCC GGC ATC GAA 1052 
Thr Arg lie Leu Asp lie Tyr Thr Pro Glu Gin Arg Ala Gly lie Glu 
20 25 30 



AAA TCC GAA GAC TGG CGG CAG TTT TCG CGC CGC ATG GAT ACG CAT TTC 



1100 
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Lys Ser Glu Asp Trp Arg Gin Phe Ser Arg Arg Met Asp Thr His Phe 
35 40 45 

CCC AAA CTG ATG AAC GAA CTC GAC AGC GTG TAC GGC AAC AAC GAA GCC 114 8 

Pro Lys Leu Met Asn Glu Leu Asp Ser Val Tyr Gly Asn Asn Glu Ala 
50 55 60 

CTG CTG CCT ATG CTG GAA ATG CTG CTG GCG CAG GCA TGG CAA AGC TAT 1196 
Leu Leu Pro Met Leu Glu Met Leu Leu Ala Gin Ala Trp Gin Ser Tyr 
65 70 75 80 

TCC CAA CGC AAC TCA TCC TTA AAA GAT ATC GAT ATC GCG CGC GAA AAC 12 44 

Ser Gin Arg Asn Ser Ser Leu Lys Asp lie Asp lie Ala Arg Glu Asn 
85 90 95 

AAC CCC GAT TGG ATT TTG TCC AAC AAA CAA GTC GGC GGC GTG TGC TAC 12 92 

Asn Pro Asp Trp lie Leu Ser Asn Lys Gin Val Gly Gly Val Cys Tyr 
100 105 110 

GTT GAT TTG TTT GCC GGC GAT TTG AAG GGC TTG AAA GAT AAA ATT CCT 134 0 

Val Asp Leu Phe Ala Gly Asp Leu Lys Gly Leu Lys Asp Lys lie Pro 
115 120 125 

TAT TTT CAA GAG CTT GGT TTG ACT TAT CTG CAC CTG ATG CCG CTG TTT 1388 
Tyr Phe Gin Glu Leu Gly Leu Thr Tyr Leu His Leu Met Pro Leu Phe 
130 135 140 

AAA TGC CCT GAA GGC AAA AGC GAC GGC GGC TAT GCG GTC AGC AGC TAC 14 36 

Lys Cys Pro Glu Gly Lys Ser Asp Gly Gly Tyr Ala Val Ser Ser Tyr 
145 150 155 160 

CGC GAT GTC AAT CCG GCA CTG GGC ACA ATA GGC GAC TTG CGC GAA GTC 14 8 4 

Arq Asp Val Asn Pro Ala Leu Gly Thr lie Gly Asp Leu Arg Glu Val 
165 170 175 

ATT GCT GCG CTG CAC GAA GCC GGC ATT TCC GCC GTC GTC GAT TTT ATC 1532 
He Ala Ala Leu His Glu Ala Gly He Ser Ala Val Val Asp Phe He 
180 185 190 

TTC AAC CAC ACC TCC AAC GAA CAC GAA TGG GCG CAA CGC TGC GCC GCC 15 8 0 

Phe Asn His Thr Ser Asn Glu His Glu Trp Ala Gin Arg Cys Ala Ala 
195 200 205 

GGC GAC CCG CTT TTC GAC AAT TTC TAC TAT ATT TTC CCC GAC CGC CGG 162 8 

Gly Asp Pro Leu Phe Asp Asn Phe Tyr Tyr He Phe Pro Asp Arg Arg 
210 215 220 

ATG CCC GAC CAA TAC GAC CGC ACC CTG CGC GAA ATC TTC CCC GAC CAG 167 6 

Met Pro Asp Gin Tyr Asp Arg Thr Leu Arg Glu He Phe Pro Asp Gin 
225 230 235 240 

CAC CCG GGC GGC TTC TCG CAA CTG GAA GAC GGA CGC TGG GTG TGG ACG 172 4 

His Pro Gly Gly Phe Ser Gin Leu Glu Asp Gly Arg Trp Val Trp Thr 
245 250 255 

ACC TTC AAT TCC TTC CAA TGG GAC TTG AAT TAC AGC AAC CCG TGG GTA 1772 
Thr Phe Asn Ser Phe Gin Trp Asp Leu Asn Tyr Ser Asn Pro Trp Val 
260 265 270 
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TTC CGC GCA ATG GCG GGC GAA ATG CTG TTC CTT GCC AAC TTG GGC GTT 1820 
Phe Arg Ala Met Ala Gly Glu Met Leu Phe Leu Ala Asn Leu Gly Val 
275 280 285 

GAC ATC CTG CGT ATG GAT GCG GTT GCC TTT ATT TGG AAA CAA ATG GGG 18 68 

Asp lie Leu Arg Met Asp Ala Val Ala Phe lie Trp Lys Gin Met Gly 
290 295 300 

ACA AGC TGC GAA AAC CTG CCG CAG GCG CAC GCC CTC ATC CGC GCG TTC 1916 
Thr Ser Cys Glu Asn Leu Pro Gin Ala His Ala Leu lie Arg Ala Phe 
305 310 315 320 

AAT GCC GTT ATG CGT ATT GCC GCG CCC GCC GTG TTC TTC AAA TCC GAA 1964 
Asn Ala Val Met Arg lie Ala Ala Pro Ala Val Phe Phe Lys Ser Glu 
325 330 335 

GCC ATC GTC CAC CCC GAC CAA GTC GTC CAA TAC ATC GGG CAG GAC GAA 2012 
Ala lie Val His Pro Asp Gin Val Val Gin Tyr lie Gly Gin Asp Glu 
340 345 350 

TGC CAA ATC GGT TAC AAC CCC CTG CAA ATG GCA TTG TTG TGG AAC ACC 2 060 

Cys Gin lie Gly Tyr Asn Pro Leu Gin Met Ala Leu Leu Trp Asn Thr 
355 360 365 

CTT GCC ACG CGC GAA GTC AAC CTG CTC CAT CAG GCG CTG ACC TAC CGC 2108 
Leu Ala Thr Arg Glu Val Asn Leu Leu His Gin Ala Leu Thr Tyr Arg 
370 375 380 

CAC AAC CTG CCC GAG CAT ACC GCC TGG GTC AAC TAC GTC CGC AGC CAC 2156 
His Asn Leu Pro Glu His Thr Ala Trp Val Asn Tyr Val Arg Ser His 
385 390 395 400 

GAC GAC ATC GGC TGG ACG TTT GCC GAT GAA GAC GCG GCA TAT CTG GGC 2204 
Asp Asp lie Gly Trp Thr Phe Ala Asp Glu Asp Ala Ala Tyr Leu Gly 
405 410 415 

ATA AGC GGC TAC GAC CAC CGC CAA TTC CTC AAC CGC TTC TTC GTC AAC 2252 
lie Ser Gly Tyr Asp His Arg Gin Phe Leu Asn Arg Phe Phe Val Asn 
420 425 430 

CGT TTC GAC GGC AGC TTC GCT CGT GGC GTA CCG TTC CAA TAC AAC CCA 2300 
Arg Phe Asp Gly Ser Phe Ala Arg Gly Val Pro Phe Gin Tyr Asn Pro 
435 440 445 

AGC ACA GGC GAC TGC CGT GTC AGT GGT ACA GCC GCG GCA TTG GTC GGC 2 34 8 

Ser Thr Gly Asp Cys Arg Val Ser Gly Thr Ala Ala Ala Leu Val Gly 
450 455 460 

TTG GCG CAA GAC GAT CCC CAC GCC GTT GAC CGC ATC AAA CTC TTG TAC 239 6 

Leu Ala Gin Asp Asp Pro His Ala Val Asp Arg lie Lys Leu Leu Tyr 
465 470 475 480 

AGC ATT GCT TTG AGT ACC GGC GGT CTG CCG CTG ATT TAC CTA GGC GAC 2444 
Ser lie Ala Leu Ser Thr Gly Gly Leu Pro Leu lie Tyr Leu Gly Asp 
485 490 495 

GAA GTG GGT ACG CTC AAT GAC GAC GAC TGG TCG CAA GAC AGC AAT AAG 24 92 

Glu Val Gly Thr Leu Asn Asp Asp Asp Trp Ser Gin Asp Ser Asn Lys 
500 505 510 



• 
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AGC GAC GAC AGC CGT TGG GCG CAC CGT CCG CGC TAC AAC GAA GCC CTG 2 54 0 

Ser Asp Asp Ser Arg Trp Ala His Arg Pro Arg Tyr Asn Glu Ala Leu 

515 520 525 

TAC GCG CAA CGC AAC GAT CCG TCG ACC GCA GCC GGG CAA ATC TAT CAG 2 58 8 

Tyr Ala Gin Arg Asn Asp Pro Ser Thr Ala Ala Gly Gin lie Tyr Gin 
530 535 540 

GGC TTG CGC CAT ATG ATT GCC GTC CGC CAA AGC AAT CCG CGC TTC GAC 2 636 

Gly Leu Arg His Met lie Ala Val Arg Gin Ser Asn Pro Arg Phe Asp 
545 550 555 560 

GGC GGC AGG CTG GTT ACA TTC AAC ACC AAC AAC AAG CAC ATC ATC GGC 2 68 4 

Gly Gly Arg Leu Val Thr Phe Asn Thr Asn Asn Lys His lie lie Gly 
565 570 575 

TAC ATC CGC AAC AAT GCG CTT TTG GCA TTC GGT AAC TTC AGC GAA TAT 27 32 

Tyr lie Arg Asn Asn Ala Leu Leu Ala Phe Gly Asn Phe Ser Glu Tyr 
580 585 590 

CCG CAA ACC GTT ACC GCG CAT ACC CTG CAA GCC ATG CCC TTC AAG GCG 27 8 0 

Pro Gin Thr Val Thr Ala His Thr Leu Gin Ala Met Pro Phe Lys Ala 

IS 595 600 605 

lfj CAC GAC CTC ATC GGT GGC AAA ACT GTC AGC CTG AAT CAG GAT TTG ACG 2 82 8 

His Asp Leu lie Gly Gly Lys Thr Val Ser Leu Asn Gin Asp Leu Thr 
It 610 615 620 

H Sj CTT CAG CCC TAT CAG GTC ATG TGG CTC GAA ATC GCC TGA CGCACGCTTC 2 877 

^ Leu Gin Pro Tyr Gin Val Met Trp Leu Glu lie Ala * 

p 625 630 635 

If! CCAAATGCCG TCTGAACCGT TTCAGACGGC ATTTGCG 2 914 

S W 



!3 

*0 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Pro Thr Gin Gin Val Gly Leu lie Leu Gin Tyr Leu Lys 
15 10 15 

Thr Arg lie Leu Asp He Tyr Thr Pro Glu Gin Arg Ala Gly He Glu 
20 25 30 

Lys Ser Glu Asp Trp Arg Gin Phe Ser Arg Arg Met Asp Thr His Phe 
35 40 45 

Pro Lys Leu Met Asn Glu Leu Asp Ser Val Tyr Gly Asn Asn Glu Ala 
50 55 60 



Leu Leu Pro Met Leu Glu Met Leu Leu Ala Gin Ala Trp Gin Ser Tyr 



65 70 75 80 

Ser Gin Arg Asn Ser Ser Leu Lys Asp lie Asp He Ala Arg Glu Asn 
85 90 95 

Asn Pro Asp Trp He Leu Ser Asn Lys Gin Val Gly Gly Val Cys Tyr 
100 105 110 

Val Asp Leu Phe Ala Gly Asp Leu Lys Gly Leu Lys Asp Lys He Pro 
115 120 125 

Tyr Phe Gin Glu Leu Gly Leu Thr Tyr Leu His Leu Met Pro Leu Phe 
130 135 140 

Lys Cys Pro Glu Gly Lys Ser Asp Gly Gly Tyr Ala Val Ser Ser Tyr 
145 150 155 160 

Arg Asp Val Asn Pro Ala Leu Gly Thr He Gly Asp Leu Arg Glu Val 
165 170 175 

He Ala Ala Leu His Glu Ala Gly He Ser Ala Val Val Asp Phe He 
180 185 190 

Phe Asn His Thr Ser Asn Glu His Glu Trp Ala Gin Arg Cys Ala Ala 
195 200 205 

Gly Asp Pro Leu Phe Asp Asn Phe Tyr Tyr He Phe Pro Asp Arg Arg 
210 215 220 

Met Pro Asp Gin Tyr Asp Arg Thr Leu Arg Glu He Phe Pro Asp Gin 
225 230 235 240 

His Pro Gly Gly Phe Ser Gin Leu Glu Asp Gly Arg Trp Val Trp Thr 
245 250 255 

Thr Phe Asn Ser Phe Gin Trp Asp Leu Asn Tyr Ser Asn Pro Trp Val 
260 265 270 

Phe Arg Ala Met Ala Gly Glu Met Leu Phe Leu Ala Asn Leu Gly Val 
275 280 285 

Asp He Leu Arg Met Asp Ala Val Ala Phe He Trp Lys Gin Met Gly 
290 295 300 

Thr Ser Cys Glu Asn Leu Pro Gin Ala His Ala Leu He Arg Ala Phe 
305 310 315 320 

Asn Ala Val Met Arg He Ala Ala Pro Ala Val Phe Phe Lys Ser Glu 
325 330 335 

Ala He Val His Pro Asp Gin Val Val Gin Tyr He Gly Gin Asp Glu 
340 345 350 

Cys Gin He Gly Tyr Asn Pro Leu Gin Met Ala Leu Leu Trp Asn Thr 
355 360 365 



Leu Ala Thr Arg Glu Val Asn Leu Leu His Gin Ala Leu Thr Tyr Arg 
370 375 380 
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His Asn Leu Pro Glu His Thr Ala Trp Val Asn Tyr Val Arg Ser His 
385 390 395 400 

Asp Asp lie Gly Trp Thr Phe Ala Asp Glu Asp Ala Ala Tyr Leu Gly 
405 410 415 

lie Ser Gly Tyr Asp His Arg Gin Phe Leu Asn Arg Phe Phe Val Asn 
420 425 430 

Arg Phe Asp Gly Ser Phe Ala Arg Gly Val Pro Phe Gin Tyr Asn Pro 
435 440 445 

Ser Thr Gly Asp Cys Arg Val Ser Gly Thr Ala Ala Ala Leu Val Gly 
450 455 460 

Leu Ala Gin Asp Asp Pro His Ala Val Asp Arg lie Lys Leu Leu Tyr 
465 470 475 480 

Ser lie Ala Leu Ser Thr Gly Gly Leu Pro Leu lie Tyr Leu Gly Asp 
485 490 495 

s,ft Glu Val Gly Thr Leu Asn Asp Asp Asp Trp Ser Gin Asp Ser Asn Lys 
500 505 510 

!^ Ser Asp Asp Ser Arg Trp Ala His Arg Pro Arg Tyr Asn Glu Ala Leu 
j** 515 520 525 

P Tyr Ala Gin Arg Asn Asp Pro Ser Thr Ala Ala Gly Gin lie Tyr Gin 
530 535 540 

]~ Gly Leu Arg His Met lie Ala Val Arg Gin Ser Asn Pro Arg Phe Asp 

545 550 555 560 

III Gly Gly Arg Leu Val Thr Phe Asn Thr Asn Asn Lys His lie lie Gly 

jfl 565 570 575 

3T Tyr lie Arg Asn Asn Ala Leu Leu Ala Phe Gly Asn Phe Ser Glu Tyr 

580 585 590 

Pro Gin Thr Val Thr Ala His Thr Leu Gin Ala Met Pro Phe Lys Ala 
595 600 605 

His Asp Leu lie Gly Gly Lys Thr Val Ser Leu Asn Gin Asp Leu Thr 
610 615 620 

Leu Gin Pro Tyr Gin Val Met Trp Leu Glu lie Ala * 
625 630 635 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCACCATGG GCATCTTGGA CATC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonuclec 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CTGCCATGGT T C AG AC G G C A TTTGG 



